Genome Medicine
Top medRxiv preprints most likely to be published in this journal, ranked by match strength.
Show abstract
MotivationFanconi anemia (FA) is a rare disease mainly caused by biallelic pathogenic variants, including structural variants such as large deletions and insertions in FA genes. Currently, variant detection is based on short-read sequencing and probe-based approaches. However, determining the exact genomic breakpoint or achieving allelic discrimination remains challenging. Nanopore-based long-read sequencing enables a comprehensive detection of FA variants, but a unified bioinformatic analysis p...
Show abstract
BackgroundKlebsiella pneumoniae is a common cause of neonatal sepsis in Africa, and is frequently hospital acquired. We recently reported an outbreak of multidrug-resistant K. pneumoniae sepsis amongst neonates at a rural hospital in The Gambia, West Africa, involving 57 cases and case fatality of 60%. Here we undertook a retrospective pathogen genomic epidemiology study of clinical and environmental K. pneumoniae isolated during the outbreak, to identify the outbreak strain, refine the epidemic...
Show abstract
Rare Mendelian disorders affect 300-400 million people globally. Although genetic testing has become widely adopted, gene-specific evidence for tailored variant interpretation remains scattered across resources. We present Gene Portals, a framework for gene-centered multimodal knowledge bases that co-localize expert-harmonized clinical data, functional assays, population variation, structural annotations and gene-specific ACMG/AMP specifications within a single resource. A modular interface inte...
Show abstract
Accurate classification of BRCA1 and BRCA2 variants is essential for cancer risk assessment and therapy selection, yet over one-third remain variants of uncertain significance (VUS). Here, using 120,660 real-world cancer genomic profiles with BRCA1 or BRCA2 variants from a >800,000-sample cohort, we develop machine learning models that predict pathogenicity using clinical and tumor-derived features, including a pan-cancer homologous recombination deficiency signature, co-mutated genes, zygosity,...
Show abstract
Tumour typing from whole-genome sequencing is increasingly accurate, yet molecular subtyping from somatic variants remains challenging because of tumour heterogeneity and inconsistent clinical annotations. Here, we present Mutation-Attention Dual-Task (MuAt2), a Transformer model that jointly classifies histological tumour types and subtypes directly from somatic single-nucleotide variants, indels and structural variants. MuAt2 leverages encoders pre-trained on 2,587 pan-cancer whole genomes, an...
Show abstract
We report a previously undescribed genotypic configuration identified in twins with HNRNPU-related neurodevelopmental disorder. Both twins have two closely spaced mosaic variants on the same allele that never co-occur on any single DNA molecule, resulting in three distinct cell lineages within each individual. We define this genotypic configuration as clustered monoallelic mosaicism (cMoMa). Recognizing the extreme improbability of such a configuration, we systematically explore two potential me...
Show abstract
Chromosome 5p15.33 harbors several independent association signals which demonstrate antagonistic pleiotropy across cancer types, with causal mechanisms largely unresolved. To identify functional variants and enhancer elements at this locus, we performed statistical fine-mapping followed by massively parallel reporter assays (MPRA) and proliferation based CRISPRi screens. This approach identified eight multi-cancer functional variants (MCFVs) across three GWAS signals. Targeting rs421629 (part o...
Show abstract
Type 2 diabetes (T2D) affects 11.1% of the global population, underscoring the need for biomarkers that inform treatment response and glycemic outcomes. We evaluated the association between the FTO variant rs9939609-A and glycemic control in a Mexican population. A total of 174 individuals living with T2D from Merida and Sisal, Yucatan, were included, of whom 85% were receiving oral hypoglycemic agents as main treatment. Glycemic control was defined cross-sectionally as good ([≤]130 mg/dL, n=...
Show abstract
BackgroundA coronary artery calcium (CAC) score of 0 is widely considered to indicate low short- to intermediate-term risk for coronary artery disease (CAD) and is frequently used to defer lipid-lowering therapy. However, a subset of individuals with CAC=0 still experience events, highlighting residual risk not captured by imaging alone. Polygenic risk scores (PRS) quantify lifelong inherited susceptibility, but conventional approaches rely on predefined ancestry labels despite human genetic div...
Show abstract
Linezolid is a critical last-resort antimicrobial for multidrug-resistant Enterococcus faecium, particularly against vancomycin-resistant lineages where therapeutic options are severely limited. While resistance has historically arisen through de novo chromosomal mutations, the global emergence of transferable resistance mechanisms threatens to render more infections untreatable. Here, we characterise a recent (2023-2024) hospital-associated outbreak of linezolid-resistant E. faecium in Queensla...
Show abstract
STUDY QUESTIONAre pathogenic variants in Homeodomain-interacting protein kinase (HIPK4) associated with sperm head abnormalities causing male infertility? SUMMARY ANSWERHIPK4 is a novel candidate gene associated with sperm head defects and human male infertility. WHAT IS KNOWN ALREADYNumerous genes causing male infertility due to Multiple Morphological Abnormalities of the sperm flagella (MMAF) have been described but the genetic basis of sperm head defects is less well understood. STUDY DESI...
Show abstract
BackgroundAnthracyclines are central to childhood cancer therapy but predispose patients to cardiotoxicity leading to long-term cardiovascular risk. Endothelial injury and impaired repair contribute to this, yet pediatric data remain limited. ObjectiveTo longitudinally assess endothelial injury and repair in childhood cancer patients treated with anthracyclines by quantifying circulating endothelial cells (CECs) and endothelial progenitor cells (EPCs). MethodsIn a single-centre retrospective c...
Show abstract
Anti-{beta}2-glycoprotein I (anti-{beta}2GPI) antibodies are central to the pathogenesis of antiphospholipid syndrome (APS), an autoimmune disease characterized by a strong predisposition to venous thromboembolism (VTE). In this study, we conducted a multi-ancestry genome-wide association study (GWAS) of quantitative total anti-{beta}2GPI levels in 5,969 participants enrolled in the Multi-Ethnic Study of Atherosclerosis (MESA) and identified a genome-wide significant association at the APOH locu...
Show abstract
BackgroundThe relationship between hip osteoarthritis (hip OA) and Alzheimers disease (AD) presents a critical paradox within the emerging "bone-brain axis": widespread phenotypic comorbidity sharply contradicts evolutionary theories of biological antagonism. This study integrates longitudinal and multi-omic analyses to determine whether this clinical overlap masks an underlying genetic neuroprotection. MethodsWe analyzed longitudinal phenotypic data from 261,767 UK Biobank participants using C...
Show abstract
Immune effector cell-associated neurotoxicity syndrome (ICANS) is a common and life-threatening complication of chimeric antigen receptor (CAR) T-cell therapy, with early detection being critical for timely intervention and improved outcomes. Cytokines such as interleukin-6 (IL-6) are key mediators of the inflammatory cascade underlying ICANS pathogenesis, but prospective clinical evidence for their predictive value is limited. Here we quantify IL-6 levels in a prospective cohort of 40 CAR-T pat...
Show abstract
RIG-I is a cytosolic immune receptor that provides the first line of defense by detecting viral RNA and triggering antiviral responses. Its physiological role in humans remains unclear, as no patients with complete RIG-I deficiency have yet been reported. We identified a critically ill COVID-19 patient with severe RIG-I deficiency caused by heterozygous RIG-I G731R, a novel dominant loss-of-function variant. The G731R mutation in helicase motif VI disrupts the arginine finger, impairing the ATPa...
Show abstract
Nocturnal glucose regulation is modulated by autonomic and circadian mechanisms, yet their dynamic interplay in apparently healthy, free-living populations remains poorly studied. Here, we assessed 227,860 nights of concurrent sleep data from Ultrahuman AIR ring and M1 continuous glucose monitoring (CGM) system across 5849 adults globally to examine nocturnal cardio-metabolic coupling. We found that higher sleep consistency was inversely associated with glucose variability, and vice versa. Unsup...
Show abstract
ObjectivesTo identify unique echocardiographic signatures associated with TTR+ carrier status preceding onset of cardiac amyloidosis. BackgroundCarrier status for the most common pathogenic TTR variant in the United States, Val142Ile (V142I), found in 4% of African Americans (AA) and 1% of Hispanic/Latino (H/L) individuals, confers a 40-60% lifetime risk of developing variant transthyretin amyloidosis (ATTRv), including cardiac amyloidosis (CA) and heart failure (HF). Myocardial amyloid deposit...
Show abstract
BackgroundHypertension affects over 30% of adults and is the leading risk factor for cardiovascular disease. It often presents without obvious symptoms, meaning that, although effective therapies exist, hypertension remains widely undiagnosed and insufficiently treated. Genomics-based prediction methods have shown only modest benefits for these disorders, but proteomic markers have demonstrated potential for greater predictive and clinical value. MethodsWe applied a novel machine-learning based...
Show abstract
Background Whole-genome sequencing (WGS) has improved the diagnosis of rare genetic disorders, yet interpretation of non-coding variants that affect splicing remains challenging. In silico predictions alone are insufficient, and short-read RNA sequencing may fail to capture complex or low-abundance splicing events. Targeted amplicon-based long-read RNA sequencing (Amp-LRS) offers a cost-effective approach for functional validation of candidate splice-altering variants. Methods We applied Amp-LRS...